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Abstract It has remained an open question for some time whether, given a set of not necessarily 
binary (i.e. "nonbinary" ) trees T on a set of taxa X , it is possible to determine in time f(r) -poly(m) 
whether there exists a phylogenetic network that displays all the trees in T, where r refers to the 
reticulation number of the network and m = \X \ + \ T\. Here we show that this holds if one or both 
of the following conditions holds: (1) \T\ is bounded by a function of r; (2) the maximum degree of 
the nodes in T is bounded by a function of r. These sufficient conditions absorb and significantly 
extend known special cases, namely when all the trees in 7" are binary [26] or T contains exactly 
two nonbinary trees [18]. We believe this result is an important step towards settling the issue 
for an arbitrarily large and complex set of nonbinary trees. For completeness we show that the 
problem is certainly solveable in time 0(m^ r ^). 

Keywords Phylogenctics • Fixed Parameter Tractability • Directed Acyclic Graphs 
1 Introduction 

A rooted phylogenetic tree on a set of taxa X is a directed tree in which exactly one node has 
indegree zero (the root), all edges are directed away from the root, the leaves are bijectively labelled 
by X and there are no nodes with indegree and outdegree both equal to one. Rooted phylogenetic 
trees, henceforth trees, are used to model the evolution of the set X starting from a (distant) 
common ancestor, the root [241I91IT0]. Rooted phylogenetic networks, henceforth networks, are 
a generalisation of trees which allow a wider array of evolutionary phenomena to be modelled, 
specifically those phenomena which involve the convergence, rather than divergence, of lineages. 
For detailed background information on networks we refer the reader to [T2, 13, 14,25,20ll2"5]. 

A network on X is a directed acyclic graph with a unique node of indegree zero (the root) from 
which all other nodes in the graph can be reached by a directed path, the leaves are bijectively 
labelled by X and there are no nodes of indegree and outdegree one (see Figure [1]) . Of particular 
interest are nodes with indegree two or higher, called reticulations. The reticulation number r(N) 
of a network is the sum of the indegrees of the reticulation nodes, minus the total number of retic- 
ulation nodes. It is the reticulation nodes that allow us to simultaneously "embed" multiple trees 
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(evolutionary hypotheses) into the network, a classical biological motivation being the embedding 
of multiple discordant gene trees into a single species network [5U] . More formally, we say that a 




Fig. 1 A (binary) rooted phylogenetic network N on X = {a, . . . , /}. This network has three reticulations. 

network N on X displays a tree T on X if a subtree T' of N exists such that: (1) T' is obtained by, 
for each reticulation in N, deleting all but one of its incoming edges, and (2) T can be obtained 
from T" by contracting some subset of the edges of T" (see Figure [5J . If every internal node of T 
has outdegree two we say that T is binary. When we say that a tree is nonbinary we simply mean 
"not necessarily binary" . (A binary tree is thus also a nonbinary tree) . Note that in the case that 
T is nonbinary the notion "displays" permits that the image of T inside TV is more resolved than 
T itself. This is motivated by the fact that biologists often use nodes with outdegree 3 or higher 
to indicate uncertainty, rather than a hard topological constraint |131ll9j . On the other hand, if T 
is binary, "displays" allows no such freedom, since T is already completely resolved. 

In recent years there has been much attention for the following optimization problem, motivated 
by the desire in biology to postulate a "most parsimonious" network which can simultaneously 
explain a set of evolutionary hypotheses modelled as trees. Given a set T of trees on the same set of 
taxa X , construct a network N on X that displays each of the trees in T with as small reticulation 
number as possible. This question has stimulated a considerable amount of mathematical research, 
with most attention thus far going to the case when T consists of two binary trees. Even this 
restricted variant of the problem is NP-hard, APX-hard [I] and possibly not even in APX, having 
similar (in)approximability properties to the classical problem feedback vertex set on directed 
graphs [17] . Despite the worrying approximability news there has been considerable progress on the 
question of fixed parameter tractability (FPT) , where the parameter in question is the minimum 
reticulation number, denoted r(T) = r. (For background information on FPT, see [5Tl[8"llll]). For 
two binary trees a suite of different (but related) FPT algorithms and tools [MMII, have been 
developed, the theoretical state-of-the-art being 0(3.18 r n) where n = \X\ [57], and a quadratic 
kernel [3][2j[7]. For more than two trees, or when T contains nonbinary trees, there is much less 
known. In |18j it is shown that, if T contains two nonbinary trees, the problem is also FPT (also 
via a quadratic kernel), although there are considerably more technicalities involved than in the 
binary case. Very recently |26| showed how to construct a quadratic kernel for an arbitrary number 
of binary trees and [55] gave a simplified bounded-search FPT algorithm for two nonbinary trees. 
This begs the question: is the problem still FPT for an arbitrary number of nonbinary trees? 
Although we do not yet have a full answer to this, we have identified quite broad conditions 
on T under which an FPT algorithm is possible, which absorbs and extends the conditions on 
T posed by [T5] and [5S]. Specifically, a set of nonbinary trees T on the same set of taxa X is 
well-bounded if at least one of the following two conditions hold, where f(r) is used to denote a 
function that depends only on r and not on the size of the input: (1) there are at most f(r) trees 
in T\ (2) the maximum degree ranging over all nodes in all trees in T is at most f(r). Clearly, the 
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Fig. 2 Here we see that the network N from Figure [T] displays the tree T shown here on the right. The dotted 
linos denote the reticulation edges that should be deleted. Note that here the image of T inside N is more resolved 
than T itself. 

case solved by [H] implies wcll-boundcdncss because \T\ < 2, and the case solved by [55] implies 
wcll-boundcdncss both because the maximum degree of any node is 3 and because without loss 
of generality it can be assumed that |T| < 2 r when all the trees in T are binary. That is, sets 
of binary trees obey both the possibilities for well-boundcdncss, despite only one being necessary, 
and this gives clues as to the comparative tractability of the binary case. Note that, when trees 
are permitted to be nonbinary, there is no obvious f(r) upper-bound on the size of T. 

In this article we give a constructive bounded-search algorithm which is FPT whenever T 
is well-bounded. We prove this by extending a related FPT result from [TS]. In that article the 
input is a set of clusters, where a cluster is simply a subset of X. We say that a tree T on X 
represents a cluster C C X whenever T contain an edge (u, v) such that the set of taxa reachable 
in T by directed paths starting from v, is equal to C . A network N on X represents a cluster 
C whenever there exists some tree T on X with the following properties: (1) N displays T and 
(2) T represents C. Given a set T of trees on X, it is natural to define the set Cl(T) as the 
set of all clusters represented by some tree in T. In [Jj)] it is shown that computing a network 
N with minimum reticulation number that represents a set of clusters C, is FPT, again using 
reticulation number as the parameter. The question immediately arises: what if we apply the 
result from |15j taking C = Cl(T)? Unfortunately, there are cases when the optimum under the 
cluster model can be strictly lower than the optimum under the tree model |25j . which stems 
from the fact that a network N might represent all the clusters in Cl{T) but not display all the 
trees in 7". However, it is tempting to ask whether the bounded-search algorithm given in |15) can 
be adapted by replacing intermediate tests of the form "does N represent Cl{T)?" with "does 
N display all the trees in T?". Here we show that in many cases the answer to this question is 
yes. However, there seem to be some pathological cases where the stronger topological demands 
of the tree model cannot easily be captured by the cluster model. These pathological cases are 
excluded by our definition of well-boundcdness, and the major open question emerging from this 
paper is whether these pathological cases are genuinely more difficult than the well-bounded case. 
We suspect that they can be overcome, because given a parameter value r and an arbitrarily large 
set of trees T with arbitrarily large maximum degree, the parameter r does impose quite severe 
restraints on the topology of the trees in T and the location of taxa in an optimum network N. We 
discuss these possibilities at the end of the article. Finally, for completeness we also give a simple 
non-FPT algorithm which, for fixed r, determines in polynomial-time whether r(7~) < r (and if so 
constructs an appropriate network), establishing that even inputs that are not well-bounded can 
be solved in polynomial time. 

Given the very close relationship between this article and [TS] we have decided not to repeat all 
algorithms from that article. For this reason it is necessary to read 15 before, or in parallel, with 
this article. The algorithmic changes are small; we only have to adapt two steps in a much larger 
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algorithmic procedure. Furthermore, it is relatively straightforward to argue that this adapted 
algorithm is correct and definitely constructs a network TV that displays all the trees in T such 
that r(N) = r(T). However, demonstrating that the running time is FPT requires much more 
work, and is the focus of this article. We are forced to deal with the aforementioned pathological 
situations by exhaustive guessing, and well-boundedness guarantees that the branching in the 
search tree caused by this pessimistic guessing does not spiral out of control. 



2 Preliminaries 

Some of the basic definitions, e.g. phylogenetic tree and phylogenetic network, have already been 
given in the introduction. In this section we will introduce several other definitions that will be 
used in the rest of the article. 

A network is said to be binary if every reticulation node has indegree 2 and outdegree 1 and 
every other interior node has outdegree 2. A (binary) refinement of a tree T is any (binary) tree 
T such that C7(T) C Cl{T'). (Note that every tree is a refinement of itself). 

Given a set of taxa X, we say that two clusters G\,Ci C X arc compatible if cither C\ C C2 
or C\ 2 C2 or C\ fl C2 = 0, and incompatible otherwise. We say that a set of taxa S C X is 
compatible with a cluster set C if every cluster C £ C is compatible with S, and incompatible 
otherwise. We say that a set S C X is an ST-set with respect to a set of clusters C, if S is 
compatible with C and the clusters of C\S are pairwise compatible, where C\S is defined as the 
set of clusters {C n S\C £ C}. An ST-set S is said to be maximal if there is no ST-set S' with 
S C 5". Given two taxa x, y £ X, we write x — if and only if every non-singleton cluster in C 
containing x, also contains y. 

An r-reticulation generator is defined as a directed acyclic multigraph, which has a single node 
of indegree and outdegree 1, precisely r reticulation nodes, and apart from that only nodes of 
indegree 1 and outdegree 2 |15j . The sides of a r-reticulation generator are defined as the union of 
its edges (the edge sides) and its reticulation nodes of outdegree (the node sides). Adding a set 
of taxa X to an edge side (u, v) of an r-reticulation generator consists of subdividing (u, v) into a 
path of \X\ internal nodes and, for each such internal node Wi, adding a new leaf w[ along with 
an edge (wi, w^), and labeling w[ with some taxon from X in such a way that X bijectively labels 
the new leaves. On the other hand, adding a taxon I to a node side v consists of adding a new leaf 
y along with an edge (v, y) and labeling y with I, Given a set of taxa X, the set M r x is defined as 
the set of all networks that can be constructed by choosing some r-reticulation generator G and 
then adding taxa to the sides of G as described above such that each taxon of X appears exactly 
once in the resulting network N and no multi-edge is present in N. (Note that a "fake root" , i.e. 
a single node with indegree and outdegree 1, will still be present in A, but this can simply be 
deleted along with its incident edge.) The resulting network N is said to be a completion of G on 
X while G is said to be the generator underlying N. 

Given a network TV, we say that a switching Xjv of A is obtained by, for each reticulation node, 
deleting all but one of its incoming edges. (The red subtree shown in Figure [5] is a switching). The 
definition of display given in the introduction is thus equivalent to: a network N on X displays 
a tree T on X if N has some switching T/v such that T can be obtained from T/v by contracting 
some subset of the edges of T/v. (In which case we say that T/v corresponds to T). For a set of trees 
T, the hybridization number of T is defined as the minimum reticulation number of A, ranging 
over all networks N that display all the trees in T. For consistency with [T5] we henceforth say 
reticulation number of T, denoted r(T), instead of hybridization number. 

We note that the value r(T) does not change if, in the definition above, we restrict our attention 
to binary networks N. This follows because there is a simple construction described in [251 Lemma 
2] which transforms a network N that displays all the trees in T into a binary network N' that 
displays all the trees in T, such that r(N') = r(N). For this reason we can restrict our attention 
to binary networks. Note that, for a binary network N and a tree T, the statement "A displays 
T" is equivalent to the statement "A displays a binary refinement of T" . 
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3 Minimizing the reticulation number of well-bounded sets of trees is fixed 
parameter tractable 

Henceforth, unless otherwise stated, the parameter r is obtained from the question "Is r(T) <r?". 
Clearly an algorithm to solve this problem can be used to solve the corresponding optimization 
problem i.e. by solving the decision problem for increasing values of r until r(7~) is reached. 

We begin by repeating the definition of well-boundedness already mentioned in the introduction. 

Definition 1 A set of nonbinary trees T on the same set of taxa X is well-bounded if at least 
one of the following two conditions hold: (1) there are at most f(r) trees in T; (2) the maximum 
degree ranging over all nodes in all trees in T is at most f(r). 

Note that if all the trees in T are binary then T is trivially well-bounded thanks to condition 
(2). However, without loss of generality we can actually also assume that when all trees in T are 
binary condition (1) holds. This is because a (binary) network with r reticulations can display up 
to 2 r binary trees. It is therefore pointless to try constructing a network with r reticulations if 
\T\ > 2 r . This places a natural f(r) upper bound on |T| in the binary case. 

The next result shows that claimed solutions can be efficiently verified. Our main algorithm 
will make heavy use of this fact to prune the search space. 

Proposition 1 Given a binary network TV with reticulation number r and a set of trees T on 
X , checking whether TV displays all the trees in T can be done in time f(r) ■ poly{m), where 
m = |#| + |T|. 

Proof. Since TV is binary, the set T(TV) of binary trees displayed by TV has cardinality at most 
2 r . Each tree in T and T(TV) contains at most 2(n — 1) clusters, where n — \X\. To determine 
if a tree T is a refinement of a tree T' we need only check that Cl(T') C Cl(T) which can be 
done in poly{n) time. In total therefore at most \T\ ■ |7~(TV)| ■ poly(n) time is required, which is 
f(r)-poly(m). □ 

Note that, since |T| can be exponential in \X\ = n (see the Appendix), the result docs not 
hold if m = \X\. This motivates our choice of m = \X | + \T\ as our measure of input size. 

Before proceeding, note that a binary network TV on X with r reticulations can represent at 
most 2 r+1 (n — 1) clusters. This follows from the proof of Proposition [1] If TV displays all the trees 
in T then it also represents all the clusters in Cl(T). Hence, if \Cl(T)\ > 2 r+1 (n — 1) we can 
immediately conclude that TV does not display all the trees in T, we call this the cluster bound. 
We may therefore henceforth assume that |C7(T)| < 2 r+1 (n — 1), which is f(r).poly(n). In any 
case, \Cl(T)\ is at most poly(m), because \Cl(T)\ < |T| • 2(n — 1). 

Let T be a set of nonbinary trees on X and let S be the set of maximal ST-sets of Cl(T). 
Given an ST-set S G 5, reducing S in T denotes the operation of, for each tree T in T . adding a 
single leaf with a new label s (the same in all trees of T) as a child node of lca^S), deleting all 
labels of S in T and finally applying in an arbitrary order the following steps until no more can be 
applied: (a) deleting all nodes with outdegree-0 that are not labelled by a taxon; (b) suppressing 
any node with indcgrcc-1 and outdcgrcc-1. We say that T is ST-collapsed if all maximal ST-sets 
of Cl{T) have size ]fl 

We denote with the term ST-collapsing the operation of reducing in a set of trees T all maximal 
ST-sets of Cl(l~). This is always possible because maximal ST-sets are disjoint (and unique) [161 
Corollary 4]. The maximal ST-sets can be computed in time poly(\X\, \Cl(T)\) [IB] , from which 
it can be seen that ST-collapsing all the trees in T can be performed in time poly(m). Note 



1 This is consistent with the definition given in |15| : a set of clusters C on X is ST-collapsed if every maximal 
ST-set of C has size 1. 
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that the set of trees T' obtained by ST-collapsing T, and the associated cluster set Cl(T') are 
ST-collapsecH. 

Lemma 1 Let T be a set of trees on X , and let T' be the set of trees obtained by ST-collapsing 
T . Then any network N' that displays all the trees in T 1 can be transformed into a network N 
that displays all the trees in T such that r(N) = r(N'), in polyijn) time. 

Proof. Let S = {Si, . . . , Si} be the set of maximal ST-sets of Cl(T). For each Sj € S we replace 
the dummy taxon Sj in N' with a binary tree on taxon set Sj that represents the set of clusters 
Cl{T)\Sj. The obtained network N obviously displays all the trees in T. By Corollary 4 of [16], 
maximal ST-sets are disjoint and |<S| is at most n. Since Cl(T) contains at most poly(m) clusters, 
the entire transformation can be performed in poly(m) time. □ 

Lemma 2 Given a set of trees T on X , let N be a network displaying all the trees in T. Let S 
be a non-singleton ST-set with respect to Cl(T). Then there exists a network N' displaying all the 
trees in T such that r(N') < r(N), S is under a cut-edge in N' and for each ST-set S' such that 
S' n S = and S' is under a cut-edge in N, S' is also under a cut-edge in N' . 

The proof of the lemma can be found in the Appendix. The following corollary stems from the 
fact that maximal ST-sets are disjoint: 

Corollary 1 Let N be a network displaying all the trees in T. There exists a network N' displaying 
all the trees in T such that r(N') < r(N) and all maximal ST-sets (with respect to Cl(T)) are 
below cut-edges. 

Lemma 3 Let T be a set of trees on X , and let T' be the set of trees obtained by ST-collapsing 
T . Then r(T') = r(T) and optimal solutions for T' can be converted into optimal solutions for T 
in time poly{m). 

Proof. This easily follows from Lemma [1] and Corollary [T] Indeed, the former result ensures that 
r(T) < r(T') while the latter that r(T) > r(V). □ 

Lemma [3] shows that, without loss of generality, we can restrict our attention to ST-collapsed 
sets of trees. The next lemma ensures that we can consequently narrow our search to the networks 
inA/X (T) . 

Lemma 4 Let T be an ST-collapsed set of trees on X , such that r(T) > 1. Then there exists a 
network N in J^x such that N displays all the trees in T . 

Proof. Let N be any binary network with reticulation number r(T) such that N displays all the 
trees in T . To prove the result, we need to prove that applying the reverse of the transformation 
described in Definition 4 of [T5] to TV will give some r(T)-reticulation generator G. The proof is 
the same of that of Lemma 4 (extended) of [TS] . (This follows because that lemma only considers 
the topology of N and a sufficient pre-condition for the lemma to hold is that N represents an 
ST-collapsed set of clusters, which is certainly the case here because N satisfies the stronger 
requirement of displaying all the trees in 7~.) □ 

We call a side of a generator G long, short or empty if it is allowed to receive respectively > 
2 taxa, 1 taxon or taxa. We call a set of side guesses for a generator G, denoted by Sg, an 
assignment of type (empty, short, or long) to each side of G. A completion N of G on X respects a 
set of side guesses Sg if long, short or empty sides in Sq respectively received > 2 taxa, 1 taxon or 
taxa in N. A pair (67, Sg) is said to be side-minimal w.r.t. T and r if there exists a network N 
such that (1) r(N) = r (2) N displays all the trees in T (3) N is a completion of G on X respecting 
Sg and (4) N has, amongst all binary networks displaying all the trees in T, a minimum number of 
long sides, and (to further break ties) amongst those networks it has a minimum number of short 



2 Essentially, ST-collapsing Cl{T) is the operation of identifying and reducing all the "maximal common pendant 
subtrees" of T, see 12211 1GI for a more detailed discussion of this. 
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sides. We define an incomplete network as a generator G, a set of side guesses Sg, a set of finished 
sides (i.e. those sides to which we are not allowed to add taxa anymore), a set of future sides (i.e. 
short and long sides on which no taxa has been placed yet) and at most one active side (i.e. a long 
side not yet declared as finished to which we have already allocated at least one taxon). A valid 
completion of an incomplete network is an assignment of the unallocated taxa to the future sides 
and (possibly) to the active side, that respects Sq and such that the resulting network displays 
all the trees in T . We are now ready to give the main lemma of this article. It shows that, once 
we have started on a new active side s, we know how to correctly continue adding taxa to it, and 
when to declare that it is finished. 

Lemma 5 Let T be a well-bounded ST- collapsed set of trees on X and let r be the first integer 
such that a network with reticulation number r displaying all the trees in T exists. Let N be an 
incomplete network such that its underlying r -reticulation generator G and set of side guesses Sq 
are such that (G,Sg) is side-minimal w.r.t. T and r, and let s be an active side of N. Then, if a 
valid completion for N exists, Algorithm]]] computes a set of (incomplete) networks M such that 
this set contains at least one network for which a valid completion exists in time f(r) ■ poly(m), 
where m = \X\ + \T\. 



Algorithm 1: addOnSidc*(A, s) 

15 if N(l, s) does not display each tree in T restricted to X{N) U {/} then declare s as finished and return N . If 
Xi is currently the only taxon on side s then return N(l, s). If all trees are safe w.r.t. s then return N(l, s). 
Otherwise, guess i.e. return both N(l, s) and N where s is declared as finished. 

53 if AT* (2, s) does not display each tree in T restricted to X(N) U {/} then guess either to end the side or to 
put I somewhere in U. Otherwise, if all trees are safe w.r.t. s then return N(l, s). Otherwise (i.e. at least one 
tree is unsafe w.r.t. s) guess between (1) ending the side, (2) returning N(l, s) or (3) putting I somewhere in 
U. 



Proof. Note that AlgorithmQ] i.e. Algorithm addOnSidc*, coincides with Algorithm 1 of [T5] but 
for line 15 and line 53. Thus, we only detail the modified lines in the pseudocode. We stress here 
that reading |15j is a prerequisite for the comprehension of this and subsequent lemmas. 

Most of the proof of this lemma coincides with the proof of Lemma 3 of [15] . This is true because 
a network displaying a set of trees T always represents the set of clusters Cl(T). (Unfortunately 
the opposite direction is not always true |25j). Hence, whenever the original algorithm rejects a 
candidate solution because it does (or will not be able to) represent the clusters Cl(T), we may 
conclude that this candidate solution certainly did not (or will not be able to) display all the trees 
in T, and thus also reject it from consideration. Note that Lemma 3 of |15j has been implicitly 
extended to also apply to ST-collapsed cluster sets and r-reticulation generators in Section 4 of 
[15) . by extending Propositions 1 and 2, and Observation 5. 

Only two parts of the proof of Lemma 3 of [15] do not work for ST-collapsed sets of trees. 
Having modified the original algorithm to obtain Algorithm Q] we now detail how to also modify 
the relevant parts of the original proof. 

Case U — 0. The only part of the proof of Lemma 3 of [TS] that does not hold here concerns 
the situation encountered when the original line 15 is reached: \L'\ = 1, B(l) = 0, and N(l, s) does 
represent C restricted to X(N) U {I}. At this point the new line 15 applies. 

It has been proven in Lemma 3 of |15j that in any valid completion of N there can be no taxon 
I' 7^ I directly above Xi on s. So all valid completions terminate the side at Xi or have I above Xi. 
We will perform several tests, one at a time, and we only resort to returning both solutions (i.e. 
branching) if the final part of line 15 is reached. 

The first test is: if N(l, s) does not display each tree in T restricted to X(N) U {/} then declare 
s as finished and return N (see Algorithm 1, line 15, first sentence). If N(l,s) does not display 
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each tree in T restricted to X(N) U {1} then it is not possible to construct a valid completion from 
N(l, s). Hence the only option that remains in this case is to terminate the side. 

The second test (Algorithm 1, line 15, second sentence) is simple. If Xi is currently the only 
taxon on side s, then we only have to return N(l,s). This is correct because of the assumption 
that s is a long side (i.e. has at least two taxa) combined with the earlier observation that the 
only taxon that can appear above on side s (if any) is I. We may thus henceforth assume that 
there is at least one taxon underneath x% on side s. Before discussing the third test, we need some 
observations about the relative location of Xi and / in the input trees. 



Fig. 3 If the unique directed path from the parent of I to the parent of Xi contains an interior node then taxon V 
must exist. But N(l, s) cannot possibly display such a tree. 

Let T be a tree of T. We may assume that there is a directed path in T (possibly of length 0) 
from the parent of I to the parent of Xi. Indeed, if this was not the case we would have that either 
the parent of Xi is an ancestor of the parent of I or these two parent nodes are not comparable i.e. 
neither is related to the other by the ancestor-descendant relation. In both cases T would contain 
a non-singleton cluster containing I but not Xi, but this is impossible since I — >ci(T) x l- Moreover, 
the directed path from the parent of I to the parent of Xi cannot contain any interior nodes. To 
see this, suppose there was some interior node on this path; this would imply the existence of a 
taxon V lying "strictly between" Xi and I (see Figure [3]) • Moreover, since B{1) = 0, we have that V 
is in X(N). But then T cannot possibly be displayed by N(l, s) because N(l, s) leaves no space for 
I' to be in the correct position. This contradicts the assumption that N(l, s) displays all the trees 
in T restricted to X(N) U {I}. Combining these insights we see that there are only two possible 
configurations for T, Cases A and B, depicted in Figure 

Irrespective of whether the tree T is in Case A or Case B we require the following definitions. 
Let p be the parent of x%. Let u\, . . . , Uk (k > 1) be the children of p not equal to X{. (Note that 
k = 1 if T is binary). For each Uj let Wj be the subtree of T rooted at uj (i.e. a sibling subtree 
of Xi), and let X(Wj) be the set of taxa in Wj. Let W be the union of all the X(Wj). Observe 
that in Case B, W C X(N) i.e. all taxa in W have already been allocated. (If this was not so, 
B(l) 7^ and we would not be in this case anyway). Observe also that in Case B all the taxa in W 
are reachable in N by directed paths from the parent of Xi- If this was not so, then N(l, s) would 
not have displayed all the trees in T restricted to X(N) U {1} (specifically: T) and we would not 
be in this case anyway. 

We say a tree T e T is safe w.r.t. s if (i) it is in Case A or (ii) it is in Case B and for each of its 
Wj, at least one taxon from X(Wj) has already been allocated to side s. If a tree is not safe w.r.t. 
s then it is unsafe w.r.t. s: it is in Case B and there exists at least one Wj such that none of the 
taxa in X{Wj) have been allocated to side s. (Combining this fact with the earlier observations 
that all the taxa in the sibling subtrees of Xi have already been allocated and are reachable by 
directed paths from the parent of Xi, we note that in this unsafe situation all the taxa in Wj must 
have been allocated to sides reachable from i.e. "underneath" side s). 

The third test (Algorithm 1, line 15, third sentence) is this: if all trees in T are safe w.r.t. s. 
then return N(l, s). We now argue that this is correct. Suppose, for the sake of contradiction, that 
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Case A Case B 

Fig. 4 Case A is when I and x have the same parent, Case B is when the parent of I is the grandparent of 

all valid completions terminate the side at X{. (As mentioned earlier it is not possible that a taxon 
V other than I is placed immediately above x\). Let N' be an arbitrary valid completion of N. 
Note that, by definition, N' has the same set of side guesses as N. Let Tjv be the switching of N' 
corresponding to a binary refinement of T. Denote by N" and Tjv" respectively the network and 
the switching obtained respectively from N' and T/y by moving I, wherever it is, onto the side s, 
just above Xi. We claim that N" displays all the trees in T. It is not too difficult to see that, if T 
is in Case A, TV" still corresponds to a binary refinement of T. This also holds if T is in Case B 
and for each Wj at least one taxon from X(Wj) is on side s; the central argument for this is that 
in any switching in a valid completion that corresponds to a binary refinement of T, the parent of 
Xi will always be the lowest common ancestor of WU {xi}. Furthermore, we can argue as in [15| 
that because of the assumed minimality of the side guesses, N" has the same set of side guesses 
as N and N', see [TS] for the full argument. Hence we can conclude that N" is a valid completion, 
yielding a contradiction. So if all trees in T are safe w.r.t. s, then returning N(l, s) is correct. 

If we have reached this point then at least one tree in T is unsafe w.r.t. s. The problem we 
face here is that for an unsafe tree T it might hold that in all switchings corresponding to a binary 
refinement of T, ranging over all valid completions, the lowest common ancestor of WU lies 
above side s. In such a switching moving I directly above Xi creates a switching that does not 
correspond to a binary refinement of T, because I will wrongly have been put "inbetween" the 
Wj. Hence, we cannot be certain that only returning N(l,s) is correct, because this might make 
it impossible to reach any valid completions. 

Hence, we guess (Algorithm 1, line 15, fourth sentence). That is, we return both N(l, s) and N 
with the side terminated just above x% . This is correct (because these are the only two possibilities) 
but it causes the search tree to branch. Unfortunately, in a valid completion there might be OflA?|) 
taxa on side s, and in the worst case we might have to branch (because some tree is unsafe) for 
each taxon as it is placed on side s. This could inflate the running time by a factor of 2°" x \\ 
which is in general not f(r).poly(mj^ However, under the assumption of well-boundedness we 
can guarantee a f(r).poly(m) running time, as we will now show. The key to proving this lies in 
proving two observations: 

Observation 1. During the execution of Algorithm 2 (which repeatedly calls Algorithm 1) a tree 

3 Note, however, that if \T\ is exponentially large as a function of \X\, 2°(W becomes poly(m), meaning that 
in such cases f(r).poly(m) running time might still be possible. 
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T can be unsafe w.r.t. s at most once. Specifically, all trees that are unsafe at the moment we 
branch, will never be unsafe again w.r.t. s. Hence, each time we branch at least one tree in T will 
become safe w.r.t. s for the remainder of the execution. If\T\ is bounded by f(r) (condition (1) of 
well-boundedness) then the inflation in the running time caused by branching will thus be limited 
to 2^ r \ which is itself f(r); after this point Algorithm 1 will never branch again. 

We now prove this observation. Suppose that it does not hold, and that some tree T becomes 




Fig. 5 If a tree is unsafe w.r.t. s a second time, and the directed path from the parent of Xi (in T) to the parent 
of V has length greater than zero, then x^ and V belong to the same W p ^ Wj, such that the lea of X(W P ) (in any 
binary refinement of T displayed by the network) is an ancestor of the lea of X(Wj), contradiction. 

unsafe w.r.t. s a second time. Let x\ and V be the corresponding taxa the first time T was unsafe 
w.r.t. s and let Xi and I be the taxa, and N the incomplete network, at the second moment of 
unsafeness, see Figure[SJ Clearly, at the second point of unsafeness x' i: V and x\ are already on side 
s, in that relative order. (We include the possibility that V = xi). Note that in T the parent of I 
is a strict ancestor of the parent of x%, and the parent of V is a strict ancestor of the parent of x\. 
(This follows because unsafeness implies Case B). Furthermore, the fact that N(l,s) displays all 
trees restricted to X(N) U {1} means that in T there is a directed path (possibly of length 0) from 
the parent of x% to the parent of V . Whichever holds, the parents of I and x% are strict ancestors 
of the parent of x[ in T. Now, given that T is unsafe w.r.t. s for a second time, at least one of the 
Wj corresponding to Xi (i.e. the sibling subtrees of Xi in T) is such that all the taxa in X(Wj) 
lie strictly underneath side s. Also, we know that N(l, s) displays T restricted to X(N) U {I}. 
Consider any binary refinement of T restricted to X(N) U {/} displayed by N(l,s), and observe 
that in such a refinement the parent of x\ lies on the directed path from the parent of x% to the 
lowest common ancestor of X(Wj). (This directed path must exist because of the location of I just 
above Xi in N(l,s)). Clearly, neither x\ nor I is in X(Wj). Recall that (by the definition of Case 
B) in T the grandparent of x' t is the same node as the parent of V . So, if the directed path from 
the parent of Xi (in T) to the parent of V has length greater than zero, then x\ and V belong to 
the same W p ^ Wj of X{. See Figure [5] 

But this cannot be so, because in the binary refinement the lowest common ancestor of Wj 
would be an ancestor of the lowest common ancestor of W v , which is not allowed. (It is not 
allowed because, whichever binary refinement we choose, all the sibling subtrees of Xi should be 
incomparable). So suppose that the parent of V is the same as the parent of Xi in T. In this case 
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I' and x\ cannot belong to the same sibling subtree of Xi, because (by Case B) the parent of x\ in 
T is not the parent of /'. Suppose x\ is in W q ^ Wj. Then ^(Wg)! > 2, otherwise it would not be 
possible for the pair x\, I' to cause the unsafeness of side s in the previous iteration. But then we 
have that, in the refinement, the lowest common ancestor of W q lies on the edge-side s itself i.e. 
is an ancestor of the lowest common ancestor of Wj (see Figure |6]), which is again not allowed. 
(The reason is the same as before: in any binary refinement the lowest common ancestors of all the 
sibling subtrees of Xi should be mutually incomparable). From this we conclude that Wj cannot 
lie entirely underneath side s. Hence some taxon of Wj is already on side s. Hence T is safe w.r.t. 
s. Hence T could not become unsafe w.r.t. s for a second time. 



Fig. 6 If a tree is unsafe w.r.t. s a second time, and the directed path from the parent of x; (in T) to the parent 
of V has zero length, then x', is in some W q ^ Wj with at least one other taxon, meaning that the lea of X(W q ) 
(in any binary refinement of T displayed by the network) is an ancestor of the lea of X(Wj), contradiction. 



So, we have shown that an f(r) bound on |T| is indeed sufficient to get the running time we 
need. We now introduce the second key observation that concerns the case when node degrees are 
bounded by f(r): 

Observation 2. Let d be the maximum node degree ranging over all nodes in all trees in T. 
Then, after at most d+1 branchings, all trees in T will have become safe w.r.t. 43. 

Suppose then that we have already branched d + 1 times. This means that there are already 
d+1 taxa at the bottom of side s. Now, suppose a tree T is unsafe w.r.t. s, so some Wj lies 
entirely underneath side s. We know that N(l, s) displays T restricted to X(N) U {I}, from which 
we can conclude that all the taxa on side s below Xi belong to (possibly different) sibling subtrees 
of Xi in T. Observe that it cannot happen that two or more of the taxa below x% on side s belong 
to the same sibling subtree W p . This holds because it would mean that in any binary refinement 
(displayed by a valid completion of N(l, s)) the lowest common ancestor of W p is an ancestor of 
Wj, and this is not allowed because they should be incomparable. So the only way T can be unsafe 
is if every taxon on s below n is in a different sibling subtree. But, because of the degree bound, 
there can only be at most d different sibling subtrees, so this is not possible. Hence T is safe, 
contradiction. 



4 We note that the definition of d, and the choice of d+ 1, is not particularly well-optimized. In particular, in 
the case (7 = branching can only happen if there are already 2 or more taxa allocated to side s\ the placement of 
the first two taxa is deterministic (thanks to the hard assumption that s is a long side). An interesting consequence 
of this is that, if T is binary, the (7 = phase of the algorithm is entirely deterministic, because after placing the 
initial two taxa on the side no tree in T can subsequently ever be unsafe w.r.t. s. 




X(Wq) 



lea of 
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This concludes the proof that the U = case terminates after at most f(r).poly(m) iterations, 
assuming well-boundedness. 

Case U 0. With the exception of the subcase encountered when the original line 53 is reached 
- i.e. when simultaneously \L'\ = 1, B(l) = 0, N(l, s) does represent C restricted to X(N) U {1} and 
N*(l, s) does represent C* - all cases can be proven as argued in Lemma 3 of [15] ■ In this subcase 
the new line 53 of Algorithm 1 applies, and we now prove its correctness. 

In this remaining subcase there are (at most) three possibilities. (1) The side terminate^] at 
Xi\ (2) I is the taxon immediately above Xi on side s; (3) / is on some side in U. Observe that this 
really covers all cases. If the side does not terminate at Xi, I is not in U and I is not immediately 
above x%, then some not yet allocated taxon p =/= I must be immediately above X{. But then we 
would have p^ci(T) x i ( so P € L) and l—*ci(T)Pi so ' ^ L', contradiction. 

We now prove that the sequence of steps shown in the new line 53 is correct. Firstly, suppose 
N*(l,s) does not display each tree in T restricted to X(N) U {I}. Then (2) is excluded as a 
possibility. So in this case we guess (1) or (3) i.e. guess either to end the side or to put I somewhere 
in U. Note that each time (1) or (3) is guessed an /(r)-counter is decremented. This is because the 
number of sides is /(r)-bounded and either s is declared as finished or a short side of U is filled. 

We may henceforth assume that TV* (7, s) does display each tree in T restricted to X(N) U {I}. 
We observe that if we have reached this point then every tree in T will be in Case A or Case 
B; the proof of this given in the case U = goes through here too. (The two comments about 
trees in Case B that follow this proof also still hold). The notion of safe and unsafe is hence still 
well-defined. In fact, the proof that - when all trees are safe - it is legitimate to simply return 
N(l,s) also holds. So the only situation left to consider is when at least one tree is unsafe. As 
argued above there are only three possibilities for action and in line 53 we consider all of them. 
From this we conclude that the algorithm is correct. However, it is still necessary to bound the 
running time. 

The only "dangerous" guess is (2) because unlike (1) and (3) this does not obviously decre- 
ment any /(r)-bounded counters. To show that this still leads us to a f(r) ■ poly(m) running time 
we will prove that it can happen at most once that a tree is unsafe and we subsequently guess 
(2). The proof is unchanged from U = 0. The only difference, and the reason that we emphasize 
the and, is that a tree might be unsafe but (rather than putting I above Xi) we decide to put I 
in U, meaning that the same tree can still be unsafe again in a later iteration. However, due to 
the /(r)-bound on the number of sides this cannot happen too often (and the case {7 = will 
be reached). Combining these insights shows that if there are at most f(r) trees in T then we 
will reach the last part of line 53 at most f(r) times in total (during the construction of side s). 
Alternatively, if the maximum degree of trees in 7~ is bounded by f(r), then (just as in the case 
U = 0) we can argue that if d+ 1 taxa have already been placed on side s, and we have survived the 
first check in line 53, then all the trees in T will be safe w.r.t. s and it is fine to only return N(l, s). 

This concludes the proof of the lemma. □ 



Algorithm 2: completeSide*(iV, s) 

Af <- N; 

while there exists N £ Af such that s is not finished in N do 
Af ^Af\N; 

N <- N U addOnSide*(7V, s); 



5 Recall that, if Xi is the only taxon currently allocated to side s then this possibility is excluded, because it 
violates the assumption that s is a long side. 
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Lemma 6 Let T be a well-bounded ST-collapsed set of trees on X and let r be the first integer 
such that a network with reticulation number r displaying all the trees in T exists. Let N be an 
incomplete network such that its underlying r -reticulation generator G and set of side guesses Sg 
are such that (G,Sa) is side-minimal w.r.t. T and r, and let s be an active side of N . Then, 
if a valid completion for N exists, Algorithm^ computes a set of (incomplete) networks TV such 
that this set contains at least one network for which a valid completion exists for which s is a 
finished side in time f(r) -poly(m), where m = \X\ + |T|. 

Proof. Algorithm [5] is the same as Algorithm 2 of [TS] , but for the fact that in the former the 
subroutine addOnSide* (defined in Algorithm [lj is called, rather than the subroutine addOnSidc 
(defined in Algorithm 1 of |15j). From this observation and by Lemma [SJ which extends Lemma 3 
of [TS] , the proof of Lemma 4 of [TS] can be easily adapted to prove this lemma. □ 



Algorithm 3: ComputeNetwork*(T) 

i foreach r-reticulation generator G in increasing side order do 
7 Af' <— completeSide*(A r (s~ , s), s) 
16 if there is a network N' S AT' displaying all the trees in T then 



Lemma 7 Let T be a well-bounded, ST-collapsed set of trees on X . Then, for every fixed r > 0, 
Algorithm^ determines whether a network N such that r(N) = r displaying every tree in T exists, 
and if so, returns it in time f(r) ■ poly(m), where m — \ X\ + \T\ ■ 

Proof. Algorithm [3] coincides with Algorithm 3 of [TS] but for lines 1, 7 and 16. Thus, we only 
detail the modified lines in the pseudocode. Since by Lemma |H we can narrow the search to the 
set M r x , by Lemma [6] we can use the same proof scheme as Lemma 5 of [15] to prove the lemma - 
the check of line 16 can be done in time /(r) ■ poly(m) by Proposition [T] □ 

From Lemmas [1] [3] and [71 we finally conclude the following: 

Theorem 1 Let T be a well-bounded set of trees on X. Then, for every fixed r > ; it is possible 
to determine in time f(r) -poly(m), where m = \X\ + \T\, whether a network that displays all the 
trees in T with reticulation number at most r exists (and if so, to return such a network). 

4 Minimizing the reticulation number of a set of trees is polynomial-time solvable 
for a fixed number of reticulations 

For completeness we show that, even though we do not yet have an FPT result for an arbitrary 
set of nonbinary trees on the same set of taxa X, we do have the following weaker result which 
shows that for a fixed number of reticulations the problem is polynomial-time solveable. This is 
strictly more general than the (implied) polynomial-time results in |26j and [18] due to the fact 
that here it is permitted to have an unbounded number of non-binary trees in the input. 

Theorem 2 Let T be a set of trees on X . Then, for every fixed r > 0, it is possible to determine 
in polynomial time - specifically, time 0{m^ r ^), where m = \X\ + |7~| - whether a network that 
displays all the trees in T with reticulation number at most r exists ( and if so, to return such a 
network). 

Proof. The proof is a straightforward extension of Theorem 1 from |16j adapted to use r-reticulation 
generators. We sketch the construction here. Without loss of generality we can assume that T is 
ST-collapsed and that we can restrict our attention to adding taxa to r-reticulation generators. 
We can also assume without loss of generality that r(T) > r. Let N be a network that displays all 



14 



Steven Kelk, Celine Scornavacca 



the trees in T, such that r(N) = r(T). We begin by guessing the correct r-reticulation generator 
for iV; there are at most /(r) such generators, and an r-reticulation generator has at most 0(r) 
sides. For each side of the generator we guess whether it has 0,1,2 or > 2 taxa on it. For sides 
with 1 taxon we guess the identity of that taxon. For sides s with > 2 taxa we guess the identity 
of the taxon nearest the root on that side, s + , and the taxon furthest from the root on that side, 
s~ . We say that a > 2 side is lowest if it does not yet have all its taxa and there is no other 
> 2 side s' ^ s with this property that is reachable by a directed path from the head of s. The 
algorithm chooses a lowest side and adds all its taxa to it, repeating this until there are no more 
lowest sides (i.e. until all \X\ taxa have been added to the network). When the sides are processed 
in this order, a taxon x belongs on side s if and only if there is a cluster C G Cl(T) such that x 
and s~ are both in C, but s + is not. Now, once all the taxa for a > 2 side s have been identified, 
their order on that side is uniquely identified. This follows because the — >ci(T) relation imposes a 
total order s + — >ci(T) ■ ■ ■ ~~*Cl(T) s ~ 011 the taxa. (Note that there cannot be cycles in the — >ci(T) 
relation because the taxa in the cycle would then induce an ST-set, contradicting the assumption 
that T is ST-collapsed [T51 Proposition 2 (extended)]). Finally, having added all the taxa to the 
r-reticulation generator we can check in time f(r).poly(m) whether it displays all the trees in T. 
In this way we can identify N in polynomial-time. □ 



5 Conclusions and future directions 

In this article we have described quite broad sufficient conditions under which the computation of 
reticulation number of a set T of trees is FPT. This extends existing FPT results, which applied 
to two specific cases: (i) two nonbinary trees, and (ii) an arbitrarily large set of binary trees. 
The obvious open question that remains is whether computation of reticulation number is still 
FPT when the "well-bounded" condition is lifted i.e. when there are an unbounded number of 
nonbinary trees in the input with unbounded maximum degree. We note already the following 
three observations, which seem to be important in this regard. Primo, if \T\ > 2 r and r(T) < r 
then two or more trees in T must have a common binary refinement (because a network with 
r reticulations can display at most 2 r distinct trees). Secundo, a r-reticulation generator has at 
most 5r sides i.e. r node sides and at most 4r edge sides [T3] . Hence, if a tree T £ T has a node u 
such that u has more than 5r children, then at least two of the taxa reachable by directed paths 
in T from u, must be on the same edge-side of the underlying r-reticulation generator in any valid 
completion. In |26j (i.e. when T is binary) a similar argument is used to develop a kernelization 
strategy in which common chains have length at most /(r). Specifically, chains with length longer 
than 5r must have at least two taxa allocated to the same edge side, from which can immediately 
be concluded that the entire common chain can be safely allocated to that side. Unfortunately, 
in the case of multiple nonbinary trees it is still not entirely clear how common chains should be 
defined and utilized. In particular, it is not clear how to generalise the definition given in |18j for 
two trees, to the case of multiple trees, such that an FPT running time is obtained even when 
the number of trees in the input is unbounded. Tertio, there has so far been little attention for the 
possibility that the problem is not FPT. The standard way of proving non-FPT is to show that a 
problem is VF[l]-hard [21,8;. It would be interesting to explore this possibility further, which would 
require developing FPT-reductions from (for example) independent set or maximum clique. Such 
problems have not yet figured prominently in the phylogenetic network literature, which makes 
this an interesting research direction in its own right. 
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7 Appendix 

Here we discuss some technical points about (bounds on) the size of the input. The first is a loose 
upper bound on the number of trees in the input. 

Observation 3. Let N be a binary network on X with r reticulations. If |T| > 2 r • 2 2 d A 'l~ 1 ) 
then N cannot display all the trees in T ■ 

Proof. N can display up to 2 r binary trees on X. By contracting all possible subsets of the edges 
of a binary tree T, we generate all possible nonbinary trees on X of which T is a binary refinement 
(and some non-valid trees too). There are 2(\X\ — 1) edges in a binary tree, from which the claim 
follows. □ 

The following toy construction shows how |7~| can become very large as a function of both r 
and \X\ = n, without introducing any obvious redundancy in the input. In particular it shows 
that even if we assume that T is ST-collapsed and that no tree in T is a refinement of another, 
there exist T for which |7~| grows exponentially quickly in both r and n. 

Choose r and then choose n sufficiently large with respect to r (we will explain later how to 




Fig. 7 A binary network N with r reticulations and X = {x\, . . . , x r , x r +%, . . . x n }, where the first r taxa are on 
node-sides and the remaining n — r taxa are all on the same edge side. 

do this). Without loss of generality we assume that n — r — 1 is odd. Consider the network N 
shown in Figure [7J Now, construct a set of trees T on X as follows. Let B be the set of 2 r binary 
trees displayed by N. For each tree T" £ B we will add a set of nonbinary trees NB(T') to T, 
although we will not add T" itself. Consider the n — r taxa on the top-left edge side of N, and the 
n — r — 1 edges that have as endpoints two parents of these taxa. We call these chain edges. The 
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set NB(T') consists of all trees obtained from T" by contracting exactly [ - | 1 ] of these edges. 
Hence, 

\r\-2 r ( n ~ r ' 1 ) 

which is exponential in both r and n. We first show that there are no trees T\,Ti £ T such that 
one is a refinement of the other. To see this, note that if T\ and T2 are in different NB(.) sets, then 
T\ and T2 cannot be refinements of each other because of the distribution of the taxa x\,...,x r 
in the trees. Suppose then that T\ and T2 both come from the same NB(.) set. Because of the 
way we have contracted edges, T\ will have some chain edge that has been contracted in T2, and 
vice-versa, proving that neither Cl{T\) C ClifT-j) nor ClifT^) C C7(Ti), from which we conclude 
that neither tree is a refinement of the other. Secondly, it can be verified that T is ST-collapscd, 
but we omit the proof. 

It remains only to show that r(T) = r. To argue this we note that any network with r 
reticulations can represent at most 2 r (2(n — 1) — n) + n = 2 r (n — 2) + n clusters. This is the usual 
cluster bound (see the discussion after Proposition [1]) but slightly tightened so that singleton 
clusters are only counted once. Hence, if we can show that \Cl(T)\ > 2 r ~ 1 (n — 2) + n then we can 
immediately conclude that r(T) = r. (Note that r(T) < r holds because every tree in T g T is 
obtained by contracting edges of one of the binary trees T" displayed by TV i.e. T" is a refinement 
of T). We claim that \Cl{T)\ > 2 r (n - r - 1). This holds because, for every tree T <E B and 
every chain edge e, there exists some tree in T g T such that T G NB(T') and the edge e is not 
contracted in T. To conclude, we only have to choose n sufficiently large that 

2 r (n-r-l) > 2 r -\n-2) + n. 

Lemma [2j Given a set of trees T on X, let N be a network displaying all the trees in T ■ Let S 
be a non-singleton ST-set with respect to Cl{T). Then there exists a network TV' displaying all the 
trees in T such that r(N') < r(N), S is under a cut-edge in N' and for each ST-set S' such that 
S' n S = and S' is under a cut-edge in N, S' is also under a cut-edge in N' . 

Proof. This lemma follows from the proof of Lemma 11 of |16j . where we described how to trans- 
form N into a network N' such that r(N') < r(N) S is under a cut-edge in N' and for each ST-set 
S' such that S' f) S = and S' is under a cut-edge in N, S' is also under a cut-edge in N'. For the 
sake of completeness, we report here the transformation. We obtain TV' from TV by the following 
transformation. Let x be any element of S and let v be the node of TV labeled by x. (a) Delete in N 
all taxa in S (but not the leaves they label, we will deal with this in step (c)). (b) Identify v with 
the root of an arbitrary binary tree T5 on S that represents Cl(l~)\S. (c) Tidy up redundant parts 
of the network possibly created in step (a) by applying in an arbitrary order any of the following 
steps until no more can be applied: deleting any nodes with outdegree-0 that arc not labelled by 
a taxon; suppressing any nodes with indegree-1 and outdegree-1; replacing any multi-edges with 
a single edge; deleting any node with indegree-0 and outdegree-1. 

To prove Lemma^l we still have to show that TV' still displays each tree in T '. Let T be a tree of 
T and let Tjy be the switching of TV corresponding to some refinement of T. In the proof of Lemma 
11 of [16], we proved that Tjv is modified by the transformation to become a switching Tm< of TV' 
such that T/v< represent^! all clusters C of TV such that SdC = or S C C. Since S is compatible 
with C, the only other clusters of Tjv are clusters C such that CCS. Note that by construction 
T N , displays T s . This implies that T N > displays the set of clusters Cl{T N ) \ (Cl(T N )\S) U Cl{T s ). 
Note that by construction, Cl{T) \ (Cl(T N )\S)U Cl(T)\S = Cl(T). Then, since Cl{T s ) 2 Cl(T)\S 
and Cl(T N ) D Cl(T), it follows that Cl(T N >) D Cl(T). So T N > is a corresponds to a refinement of 
T and this concludes the proof. □ 



6 Although a switching is not, formally speaking, a phylogenetic tree - because it can have nodes with indegrec 
and outdegree both equal to one, and possibly a redundant node with indegrec and outdegree 1 - the definitions 
of "represents" and "displays" still hold and behave as expected. 



